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Abstract 

We present a framework for approximating the metric TSP based on a novel use of matchings. 
TraditionaUy, matchings have been used to add edges in order to make a given graph Eulerian, 
whereas our approach also allows for the removal of certain edges leading to a decreased cost. 

For the TSP on graphic metrics (graph-TSP), the approach yields a 1.461-approximation 
algorithm with respect to the Held-Karp lower bound. For graph-TSP restricted to a class of 
graphs that contains degree three bounded and claw-free graphs, we show that the integrality 
gap of the Held-Karp relaxation matches the conjectured ratio 4/3. The framework allows 
for generalizations in a natural way and also leads to a 1.586-approximation algorithm for 
the traveling salesman path problem on graphic metrics where the start and end vertices are 
prespecified. 

1 Introduction 

The traveling salesman problem in metric graphs is one of most fundamental NP-hard optimization 
problems. In spite of a vast amount of research several important questions remain open. While the 
problem is known to be APX-hard and NP-hard to approximate with a ratio better than 220/219 
|2(Jj . the best upper bound is still the 1.5- approximation algorithm obtained by Christofides |3] 
more than three decades ago. A promising direction to improve this approximation guarantee, has 
long been to understand the power of a linear program known as the Held-Karp relaxation |14] . 
On the one hand, the best lower bound on its integrality gap (for the symmetric case) is 4/3 and 
indeed conjectured to be tight [Tl]. On the other hand, the best known analysis [22l |23] is based 
on Christofides' algorithm and gives an upper bound on the integrality gap of 1.5. 

In the light of this difficulty of even determining the integrality gap of the Held-Karp relaxation, 
a reasonable way to approach the metric TSP is to restrict the set of feasible inputs. One promising 
candidate is the graph-TSP, that is, the traveling salesman problem where distances between cities 
are given by any graphic metric, i.e., the distance between two cities is the length of the shortest 
path in a given (unweighted) graph. Equivalently, graph-TSP can be formulated as the problem 
of finding an Eulerian multigraph within an unweighted input graph so as to minimize the number 
of edges. In contrast to TSP on Euclidean metrics that admits a PTAS [H |T7], the graph-TSP 
seems to capture the difficulty of the metric TSP in the sense that, as stated in |12] . it is APX-hard 
and the lower bound 4/3 on the integrality gap of the Held-Karp relaxation is established using a 
graph-TSP instance. 

The TSP on graphic metrics has recently drawn considerable attention. In 2005, Gamarnik 
et al. [8j showed that for cubic 3-edge-connected graphs, there is an approximation algorithm 
achieving an approximation ratio of 1.5 — 5/389. This result was generalized to cubic graphs by 

*This research was supported by ERG Advanced investigator grant 226203. 



1 



Boyd et al. 0, who obtained an improved performance guarantee of 4/3. For subcubic graphs, 
i.e., graphs of degree at most 3, they also gave an 7/5-approximation algorithm with respect to 
the Held-Karp lower bound. In a major achievement, Gharan et al. [9J recently presented an 
approximation algorithm for graph-TSP with performance guarantee strictly better than 1.5. The 
approach in [9] is similar to that of Christofides in the sense that they start with a spanning tree 
and then add a perfect matching of those vertices of odd-degree to make the graph Eulerian. The 
main difference is that instead of starting with a minimum spanning tree, their approach uses the 
solution of the Held-Karp relaxation to sample a spanning tree. Although the proposed algorithm 
in [9] is surprisingly simple, the analysis is technically involved and several novel ideas are needed 
to obtain the improved performance guarantee 1.5 — e for an e of the order 10"^'^. 

Our Results and Overview of Techniques. We propose an alternative framework for approx- 
imating the metric TSP and use it to obtain an improved approximation algorithm for graph-TSP. 



Theorem 1.1 There is a polynomial time approximation algorithm for graph-TSP with perfor- 
mance guarantee < 1.461. 

The result implies an upper bound on the integrality gap of the Held-Karp relaxation for graph- 
TSP that matches the approximation ratio. For the restricted class of graphs, where each block 
(i.e., each maximally 2- vertex-connected subgraph) is either claw- free or of degree at most 3, we 
use the framework to construct a polynomial time 4/3-approximation algorithm showing that the 
conjectured integrality gap of the Held-Karp relaxation is tight for those graphs. Li fact, the 
techniques allow us to prove the tight result that any 2- vertex-connected graph of degree at most 
3 has a spanning Eulerian multigraph with at most 4n/3 — 2/3 edges, which settles a conjecture of 
Boyd et al. [3j affirmatively. 

Our framework is based on earlier works by Frederickson & Ja'ja' [7J and Monma et al. |18j . 
who related the cost of an optimal tour to the size of a minimum 2-vertex-connected subgraph. 
More specifically, Monma et al. showed that a 2-vertex-connected graph G = {V, E) always has a 
spanning Eulerian multigraph with at most ^\E\ edges, generalizing a previous result of Frederickson 
& Ja'ja' who obtained the same result for the special case of planar 2-vertex-connected graphs. One 
interpretation of their approaches is the following. Given a 2-vertex-connected graph G = {V^E), 
they show how to pick a random subset M of edges satisfying: (i) an edge is in M with probability 
1 /3 and (ii) the multigraph H with vertex set V and edge set E [J M \s spanning and Eulerian. 
From property [i) of M, the expected number of edges in H is yielding their result. 

Although the factor 4/3 is asymptotically tight for some classes of graphs (one example is the 
family of integrality gap instances for the Held-Karp relaxation described in Section [2]) , the bound 
rapidly gets worse for 2-vertex-connected graphs with significantly more than n edges. The novel 
idea to overcome this issue is the following. Instead of adding all the edges in M to G, some of 
the edges in M might instead be removed from G to form H. As long as the removal of the edges 
does not disconnect the graph, this will again result in a spanning Eulerian multigraph H. To 
specify a subset R of edges that safely may be removed we introduce, in Section [3j the notion of 



a "removable pairing". The framework is then completed by Theorem 3.2, where we show that 



a 2-vertex-connected graph G = {V, E) with a set R of removable edges has a spanning Eulerian 
multigraph with at most ^\E\ — \ \R\ edges. 

In order to use the framework, one of the main challenges is to find a sufficiently large set of 
removable edges. In Section [4j we show that this problem can be reduced to that of finding a 
min-cost circulation in a certain circulation network. To analyze the circulation network we then 
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(in Section [5]) use several properties of an extreme point solution to the Held-Karp relaxation to 
obtain our main algorithmic result. The better approximation guarantees for special graph classes 
follows from that the circulation network has an easier structure in these cases, which in turn allows 
for a better analysis. 

Finally, we note that the techniques generalize in a natural way. Our results can be adapted to 
the more general traveling salesman path problem (graph-TSPP) with prespecified start and end 
vertices to improve on the approximation ratio of 5/3 by Hoogeveen [15J when considering graphic 
metrics. More specifically, we obtain the following. 

Theorem 1.2 For any e > 0, there is a polynomial time approximation algorithm for graph-TSPP 
with performance guarantee 3 — \/2 + e < 1.586 + e. 

If furthermore each block of the given graph is degree three bounded, there is a polynomial time 
approximation algorithm for graph-TSPP with performance guarantee 1.5 + e, for any e > 0. 

The generalization to the traveling salesman problem is presented in Section [6j 

2 Preliminaries 

Held-Karp Relaxation. The linear program known as the Held-Karp (or subtour elimination) 
relaxation is a well studied lower bound on the value of an optimal tour. It has a variable x^^^^^j 
for each pair of vertices with the intuitive meaning that „} should take value 1 if the edge 
{u, v} is used in the tour and otherwise. Letting G = (V, E) be the complete graph on the set of 
vertices and c^u,v} be the distance between vertices u and v, the Held-Karp relaxation can then be 
formulated as the linear program where we wish to minimize X^eeE (^eXe subject to 

x{6{v)) = 2ioTveV, x{6{S)) > 2 for / 5 C y, and x > 0, 

where 6{S) denotes the set of edges crossing the cut (5, S) and x{F) = X^gg^ Xe for any FOE. 

Goemans &: Bertsimas [1U| proved that for metric distances the above linear program has the 
same optimal value as the linear program obtained by dropping the equality constraints. Moreover, 
when considering a graph-TSP instance G = {V, E) we only need to consider the variables (xe)ee-B- 
Indeed, any solution x to the Held-Karp relaxation without equality constraints such that x^^^^y > 
for a pair of vertices {u, v} ^ E can be transformed into a solution x' with no worse cost and 
^'{u t,} = by setting Xg = Xg + x^u^^y for each edge on the shortest path between u and v, and 
Xg = Xe for the other edges. The Held-Karp relaxation for graph-TSP on a graph G = (V, E) can 
thus be formulated as follows: 

min Xe subject to x{5{S)) > 2 for ^ ^ S C V, and x > 0. 

ee_E 

We shall refer to this linear program as LP{G) and denote the value of an optimal solution by 
OPTlp{G). Its integrality gap was previously known to be at most 3/2 — e and at least 4/3 for 
graphic instances. The lower bound is obtained by a claw-free graphic instance of degree at most 

3 that consists of three paths of equal length with endpoints (si,ti), (527^2)1 and (53,^3) that are 
connected so as {51,52,53} and {ti,t2,t^} form two triangles (see Figurejs]). 

We end our discussion of LP{G) with a useful observation. When considering graph-TSP, it 
is intuitively clear that we can restrict ourselves to 2-vertex- connected graphs, i.e., graphs that 
stay connected after deleting a single vertex. Indeed, if we consider a graph with a vertex v 
whose removal results in components Ci, . . . ,Ce with i > 1 then we can recursively solve the 
graph-TSP problem on the £ subgraphs Gi, G2, . . . , induced by Ci U {v}, C2 U {v}, . . . , U {v}. 
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The union of these solutions wiU then provide a solution to the original graph that preserves 
the approximation guarantee with respect to the linear programming relaxation since one can see 
that OPTlp{G) > X]i=i OPTLp{Gi). We summarize this observation in the following lemma (see 



Appendix B.l for a fullproof). 



Lemma 2.1 Let G he a connected graph. If there is an r- approximation algorithm for graph- 
TSP on each 2-vertex-connected subgraph H of G (with respect to OPTlp{H)) then there is an 
r -approximation algorithm for graph-TSP on G (with respect to OPTlp(G)). 

Matchings of Cubic 2-Edge-Connected Graphs. Edmonds ^ showed that the following set 
of equalities and inequalities on the variables {xe)e&E determines the perfect matching polytope 
(i. e., all extreme points of the polytope are integral and correspond to perfect matchings) of a given 
graph G = {V,E): 

x{5{v)) = lioTveV, x{5{S)) > 1 for 5 C y with |5| odd, and x > 0. 

The linear description is useful for understanding the structure of the perfect matchings. For 
example, Naddef and Pulleyblank [TH] proved that Xf, = 1/3 defines a feasible solution when G is 
cubic and 2-edge connected, i.e., every vertex has degree 3 and the graph stays connected after 
the removal of an edge. They used that result to deduce that such graphs always have a perfect 
matching of weight at least 1/3 of the total weight of the edges. 

Standard algorithmic versions of Caratheodory's theorem (see e.g. Theorem 6.5.11 in [13]) say 
that, in polynomial time, we can decompose a feasible solution to the perfect matching polytope 
into a convex combination of polynomially many perfect matchings (see also [2j for a combinatorial 
approach for the matching polytope) . Combining these results leads to the following lemma (see |3l 
m [18] for closely related variants that also have been useful for the graph-TSP problem) . 

Lemma 2.2 Given a cubic 2 -edge- connected graph G, we can in polynomial time find a distribu- 
tion over polynomially many perfect matchings so that with probability 1/3 an edge is in a perfect 
matching picked from this distribution. 

Note that all 2-vertex-connected graphs except the trivial graph on 2 vertices are 2-edge connected. 
We can therefore apply the above lemma to cubic 2-vertex-connected graphs. 

3 Approximation Framework 



Lemma |2.1| says that the technical difficulty in approximating the graph-TSP problem lies in ap- 
proximating those instances that are 2-vertex connected. As alluded to in the introduction, we 
shall generalize previous results [3 [18] that relate the cost of an optimal tour to the size of a 
minimum 2-vertex-connected subgraph. The main difference is the use of matchings. Traditionally, 
matchings have been used to add edges to make a given graph Eulerian whereas our framework 
offers a structured way to specify a set of edges that safely may be removed leading to a lower cost. 
To identify the set of edges that may be removed we use the following definition. 

Definition 3.1 (Removable pairing of edges) Given a 2-vertex-connected graph G we call a 
tuple {R, P) consisting of a subset R of removable edges and a subset P R x R of pairs of edges 
a removable pairing if 

• an edge is in at most one pair; 

• the edges in a pair are incident to a common vertex of degree at least 3; 
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Figure 1: Examples of the used gadgets to obtain a cubic graph. 



• any graph obtained by deleting removable edges so that at most one edge in each pair is deleted 
stays connected. 

The fohowing theorem generahzes the corresponding result of [18J (their result follows from the 
the special case of an empty removable pairing). 

Theorem 3.2 Given a 2-vertex- connected graph G = {V, E) with a removable pairing {R, P), there 
is a polynomial time algorithm that returns a spanning Eulerian multigraph in G with at most 
i ■ \E\ — ^ ■ \R\ edges. 



The proof of the theorem is presented after the following lemma on which it is based. 

Lemma 3.3 Given a 2-vertex- connected graph G = {V, E) with a removable pairing (R, P), we can 
in polynomial time find a distribution over polynomially many subsets of edges such that a random 
subset M from this distribution satisfies: 

(a) each edge is in M with probability 1/3; 

(b) at most one edge in each pair is in M ; and 

(c) each vertex has an even degree in the multigraph with edge set EL) M. 



Proof. We shall use Lemma 2.2 and will therefore need a cubic 2-edge-connected graph. In the 
spirit of |7| 
graph G' 



we replace all vertices of G that are not of degree three by gadgets to obtain a cubic 
{V',E') as follows (see also Figure [l]): 



• A vertex v of degree 2 with neighbors u and w is replaced by a cycle consisting of four vertices 
fAT, vw, vs, ve with the chord {v]v,ve}. The gadget is then connected to the neighbours of 
V by the the edges {u,vn} and {vs,w}. 

• A vertex v with d{v) > 3 is replaced by a tree Ty that has [d{v)/2\ leaves, a binary root 
if d(v) is odd, and otherwise only degree 3 internal vertices. Each leaf is connected to two 
neighbours of v such that the edges incident to v that form a pair in P are incident to the 
same leaf. If d{v) is odd, one of the neighbors is left and connected to the binary root. 

The above gadgets guarantee that the graph G' is cubic and it is 2-vertex connected since G was 



assumed to be 2-vertex connected. We can therefore apply Lemma 2.2 in order to obtain a random 



perfect matching M' . Each edge of G' is in M' with a probability of exactly 1/3. Let M be the set 
of edges obtained by restricting M' to the edges of G in the obvious way. Now M contains each 
edge of G with probability 1/3. We complete the proof by showing that M also satisfies properties 
(b) and (c). As each pair of edges in P is incident to a vertex of degree at least 3, we have, by the 
construction of the gadgets, that they are incident to a common vertex in G' and hence at most 
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one edge of each pair is in M. Finally, property (c) follows from that E' U M' is clearly a spanning 
Eulerian multigraph of G' and compressing a set of even-degree vertices results in one vertex of 
even degree. □ 

Equipped with the above lemma we are now ready to prove the main result of this section. 
Proof of Theorem 13. 2i Pick a random subset M ^ E oi edges that satisfies the properties of 



Lemma 3.3 Let Mr be the set of those edges of M that are removable and let Mr be the set of 
the remaining edges of M. 

Consider the multigraph H on vertex set V and edge set E \ Mr U Mi?,. Observe that both 
adding an edge and removing an edge swaps the parity of the degree of an incident vertex. We 



have thus from property (c) of Lemma 3.3 that the degree of each vertex in H is even. Moreover, 



as {R, P) is a removable pairing, property (6) of Lemma 3.3 gives that H is connected. AUtogether 
we have that H is an Eulerian graph, i.e., a graph-TSP solution. We continue to calculate its 
expected number of edges, which is 

E[\E\ + \MR\-\Mn\]. (1) 

Using that each edge is in M with probability 1/3, we have, by linearity of expectation, that ([T]) 
equals 

\E\ + ^-i\E\-\R\)-l\R\ = ^.\E\-'^-.\R\. 

To conclude the proof, we note that the selection of M can be derandomized since there are, 
by Lemma |3.3[ polynomially many edge subsets to choose from; taking the one that minimizes the 
number of edges of H is sufficient. □ 



4 Finding a Removable Pairing by Minimum Cost Circulation 

In order to use our framework, one of the main challenges is to find a removable pairing that is 
sufficiently large. In the following, we show how to obtain a useful removable pairing based on 
circulations. 

Consider a 2- vertex connected graph G and let T be a spanning tree of G obtained by depth-first 
search (starting from some arbitrary root r). Then each edge in G connects a vertex to either one 
of its predecessors or one of its successors. We call the edges in T tree-edges and those in G but 
not in T back-edges. 

We shall now define a circulation network G{G,T). We start by introducing an orientation of 
G: all tree-edges become tree-arcs directed from the root to the leaves and all back-edges become 
back-arcs directed towards the root. To distinguish the circulation network and the original graphs, 
we use the names ^ and ^ for the network versions of G and T. In order to ensure connectivity 
properties of subnetworks obtained from feasible circulations, we replace some of the vertices by 
gadgets. 

For each vertex v except the root that has £ children wi,W2, . . . ,W£ in the tree, we introduce 
£ new vertices vi, V2, vi and replace the tree-arc {v,Wj) by the tree-arcs {v,Vj) and {vj,Wj) 
for j = 1,2, . . . Then we redirect all incoming back-arcs of v from the subtree rooted by wj to 
Vj . For an illustration of the gadget see Figure [2] and for an example of a complete network see 
Figure [7j This way, all back- arcs start in old vertices and lead to new vertices or the root. In the 
following, we call the new vertices and the root in-vertices and the remaining old ones out-vertices. 
We also let I be the set of all in-vertices. 
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Figure 2: The gadget that, for each child of v, introduces a new vertex (depicted in white) and 
redirects back-arcs. 

We now specify a lower bound (demand) and an upper bound (capacity) on the circulation. 
For each arc a in 7^, we set the demand of a to 1 and for all other arcs to 0. The capacity is 
oo for any arc. Finally, the cost of a circulation / in C{G,T) is the piecewise linear function 
^^gj max[/(i?(?7)) — 1, 0], where B(v) is the set of incoming back-arcs of v. One can think of the 
cost as the total circulation on the back-arcs except that each in- vertex accepts a circulation of 1 for 
free. Note that algorithmically there is no considerable difference whether we use our cost function 
or define a linear cost function on the arcs: for any in- vertex v we can redirect all back- arcs of v to 
a new vertex v' and introduce two arcs {v' , v), one of cost and capacity 1 and the other of cost 1 
and capacity oo. All remaining arcs then have a cost of 0. 

The following lemma shows how to use a circulation in C{G,T) to approximate graph-TSP. 

Lemma 4.1 Given a 2-vertex connected graph G and a depth first search tree T of G let G* be the 
minimum cost circulation to C{G,T) of cost c{G*). Then there is a spanning Eulerian multigraph 
G' in G with at most |n + |c(C*) — 2/3 edges. 

Proof. We first note that, for any arc of C(G, T), the demand and the capacity is integral. There- 
fore, applying Hoffman's circulation theorem (see [21j . Corollary 12.2a), we can assume the circula- 
tion G* to be integral. Let G*{G, T) be the support of C* in C(G, T), i. e., the induced subgraph of 
the arcs with non-zero circulation in C*, and let G' be the subgraph of G obtained from G*{G, T) 
by compressing the gadges of the circulation network in the obvious way. 

To prove the lemma, we shall first prove that graph G' is 2-vertex connected and then define a 



removable pairing (i?, P) on G' in order to apply Theorem 3.2 That G' is 2-vertex connected follows 
from flow conservation, that each arc a in 7^ has demand 1, and the design of the gadgets. Indeed, 
if G' would have a cut vertex v with children wi,W2, ■ ■ ■ va. T then one of the subtrees, say the 
one rooted by Wj, has no back-edges to the ancestors of v which in turn, by flow conservation, 
would contradict that the tree-arc {v,Vj) in 7^ carries a flow of at least 1. (Recall that the edge 
{v,Wj} in T is replaced by tree-arcs {v,Vj) and {vj,Wj) in 7^.) 

We now determine a removable pairing {R,P) on G' . For ease of argumentation we shall first 
slightly abuse notation and define a removable pairing {Rc,Pc) on G*{G,T). The set Pc consists 
of all (e, e') such that e = {u, v) is a back-arc of cost zero in C*{G, T), v has at least two incoming 
arcs, and e' = {v, w) is a tree-arc. Note that each such v is an in-vertex, the number of incoming 
back-arcs of cost zero is at most one, e' is the unique outgoing tree-arc of v, and the only possible 
vertex v with only one incoming back-arc and no other incoming arc is the root. The set Rc 
contains all edges from Pc and additionally all remaining back-arcs of C*{G,T). In other words, 
each edge of G*{G, T) that is neither in 7^ nor in P is a back-arc with integer non-zero cost in the 
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circulation or a back-arc to the root. Hence, \Rc\ — 2|-Pc'| = if the root has more than one 

incoming back-arc and \Rc\ — 2|-Pc| = c(C*) + 1 otherwise. 

The removable pairing {R,P) on G' is now obtained from {Rc,Pc)^ by mereley compressing 
the gadgets used to form C(G, T) and by dropping the orientations of the arcs. As all edges in Rq 
are either back-arcs or they are tree-arcs starting from an in-vertex, no arc in Rc is removed by 
the compression and thus \R\ = \Rc\ and \P\ = \Pc\- Moreover, G' has (n — 1) + \R\ — \P\ edges 
and, assuming {R,P) is a valid removable pairing. Theorem 3.2 yields that G' (and thus G) has a 
spanning Eulerian multigraph with at most |((n — 1) + |i2| — |P|) — ||-R| = |n+ — 2|P|) — | < 
|n -|- |c(C*) — I edges. The last inequality followed from that — 2\P\ is at most c{C*) + 1. 

Therefore, we can conclude the proof by showing that {R, P) is a valid removable pairing. It 
is easy to verify that {R, P) satisfies the first two conditions of Definition 3.1, that is, each edge 
is contained in at most one pair and the edges in each pair are incident to one common vertex of 
degree at least three. The third condition follows from that, for any vertex v of G' , the vertices 
in the subtree of T rooted by v form a connected subgraph of G' even after removing edges 
according to {R,P). To see this we do a simple induction on the depth of In the base case, 
f is a leaf and the statement is clearly true. For the inductive step, consider a vertex v with i 
children 101,1x12, Wi in T . By the inductive hypothesis, the vertices in T^^,^. for j = 1,2,...,^ 
stay connected after the removal of edges according to {R,P). To complete the inductive step it 
is thus sufficient to verify that v is connected to each T^^. after the removal of edges. If {v,Wj} is 
not in R this clearly holds. Otherwise if ej = {v, wj} € R then by the definition of {R, P) there is 
an edge e such that (e, ej) £ P and e is incident to v and a vertex in T^„^. . Since at most one edge 
in each pair is removed we have that v also stays connected to T^. in this case, which completes 
the inductive step. We have thus proved that {R, P) satisfies the properties of a removable pairing 
which completes the proof of the statement. 

□ 



5 Improved Approximation Algorithms 

We first show how to apply our framework to restricted graph classes for which we obtain a tight 
bound on the integrality gap of the Held-Karp relaxation. We then show how to use our framework 
to obtain an improved approximation algorithm for general graphs. 

5.1 Bounded Degree and Claw- Free Graphs 

We consider the class of graphs that have a degree bounded by three. 

Lemma 5.1 Given a 2-vertex-connected graph G with n vertices, there is a polynomial time algo- 
rithm that computes a spanning Eulerian multigraph H in G with at most 4n/3 — 2/3 edges. 

Proof. If G has one or two vertices, we obtain an Eulerian multigraph of zero or two edges. 
Otherwise, we compute a depth- first search tree T in G and determine the circulation network 
C{G,T). We now show that this network has a feasible circulation / of cost at most one. Let 
us assign a circulation of one to each back-arc e in C{G,T) and push it through the path in 7^ 
that is incident to both the start and end vertex of e. By the construction of G{G,T) and from 
the assumption that G is 2-vertex connected, each tree-arc is in a directed cycle that contains 
exactly one back-arc. Therefore, all demand constraints are satisfied. Due to the degree-bounds, 
no vertex but the root has more than one incoming back-arc. The cost ^^gj max[/(i?(f )) — 1,0] 
of the circulation is therefore at most one and zero if the root has only one back-arc. If the 
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circulation cost is zero, by Lemma 4.1 we obtain a spanning Eulerian multigraph H in G with at 



most 4n/3 — 2/3 edges. For those circulations where the cost is one, the proof of Lemma 4.1 allows 
to save an additional constant of 2/3 (since then the root has more than one incoming back-arc) 
and we obtain the same bound on the number of edges. 

□ 

Note that it is sufficient to find a 2-vertex-connected degree three bounded spanning subgraph (a 



3-trestle) and thus, using a result from [16j, we can apply Lemma 5.1 also to claw- free graphs 



Applying Lemma 2.1 we obtain an upper bound of 4/3 on the integrality gap for the Held-Karp 
relaxation for the considered class of graphs. In addition, along the lines of the proof of Lemma [2. 1[ 
one can see that the above arguments imply that any connected graph G decomposed into k blocks, 
i. e., maximal 2-connected subgraphs, such that each block is either degree three bounded or claw- 
free, has a spanning Eulerian multigraph with at most 4n/3 + 2A;/3 — 4/3 edges. 

5.2 General Graphs 

We now apply our framework to graphs without degree constraints. We start with an algorithm 
that achieves an approximation ratio better than 3/2 for graphs for which the linear programming 
relaxation has a value close to n. Let G = {V, E) be an n-vertex graph. The support E' = 
{e : X* > 0} of an extreme point x* of LP(G) is known to contain at most 2n — 1 edges (see 
Theorem 4.9 in |5|). Moreover, if we let x* be an optimal solution, then any r-approximate solution 
to graph G' = {V,E') with respect to OPTlp{G') is an r-approximate solution to G with respect 
to OPTlp{G), because E' C E and OPTlp{G') = OPTlp{G). We can thus restrict ourselves to 
n-vertex graphs with at most 2n — 1 edges and, by Lemma |2.1[ we can further assume the graph 
to be 2-vertex connected. 

Algorithm 1 

Input: A 2-vertex-connected graph G with n vertices and at most 2n — 1 edges. 
1: Obtain an optimal solution x* to LP{G). 

2: Obtain a depth-first-search tree T of G by starting at some root and in each iteration pick, 

among the possible edges, the edge e with maximum x*. 
3: Solve the min cost circulation problem G{G,T) to obtain a circulation G* with cost c(C*). 



Apply Lemma 4.1 to find a spanning Eulerian multigraph with less than |n + |c(C*) edges. 



To analyze the approximation ratio achieved by Algorithm [T| we bound the cost of the circula- 
tion. 

Lemma 5.2 We have c{G*) < 6(1 - V2)n + (4\/2 - 3)0PTlp{G). 

Proof. For notational convenience, when considering an arc a in the flow network we shall slightly 
abuse notation and use x* to denote the value of the corresponding edge in G according to the 
optimal LP-solution x* . We prove the statement by defining a fractional circulation / of cost 
at most 6(1 - \/2)n + (4^2 - 3)0PTlp{G). The circulation / will in turn be the sum of two 
circulations /' and /". We obtain the circulation /' as follows: for each back-arc a we push a 
flow of size min[x*, 1] along the cycle formed by o and the tree-arcs in T^. We shall now define 
the circulation /" so as to guarantee that / forms a feasible circulation, i. e., one that satisfies the 
demands /a > 1 for each a G 1^. As out- and in- vertices are alternating in T and in- vertices have 
only one child in 7^ and no outgoing back-edges, a sufficient condition for / to be feasible can be 
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Figure 3: An illustration of Equality ([2]) with = {wi,W2, ■ ■ ■ ,W£}: both the left-hand-side and 
the right-hand-side of the equality express two times the value of the fat edges. 

seen to be /a > 1 for each a G 7^ that is from an out-vertex to an in-vertex. To ensure this, we now 
define /" as follows. For each vertex f of G that is replaced by a gadget consisting of an out-vertex 
V and a set Zy of in- vertices, we push for each w £ ly a flow of size max[l — f'^^ yjy^] along a cycle 
that includes the arc {v,w) (and one back-arc). Note that such a cycle is guaranteed to exist since 
G was assumed to be 2- vertex connected. From the definition of /", we have thus that f = f + f" 
defines a feasible circulation. 

We proceed by analyzing the cost of /, i.e., ^^gjmax[/(i?(i;)) — 1,0], where I is the set of 
all in- vertices and B{v) is the set of incoming back-arcs of w G X. Note that the cost is upper 
bounded by ^^gj max[/'(i?(T;)) — 1,0] -|- X^,„gi /"(-B(t')) and we can thus analyze these two terms 
separately. We start by bounding the second summation and then continue with the first one. If 
OPTlp[G) = n then one can see that /" = 0. Moreover, 



Claim 5.3 We have J2yex fi^iv)) < OPTlp{G) 



n. 



Proof of Claim. When considering a vertex v as done above in the definition of /", the flow pushed 
on back-arcs is X]u,ex„ ~ f'(v,w)^^^ ^^^^^^ equals Y.weX'S'^ ~ f{v,w))' ^^^^^ Ty = {w £ ly : 

f[v w) ^ Letting Tyy be the set of vertices of G in the subtree of the undirected tree T rooted 
by the child of G X(,, we have, by the definition of /', 

4,-) = E ^^i^K' 1] = ^*(^iTy,) \ 5{V)). 



w Ty. We have thus Y.y,^x'S^ ~ f[v,w)) = l^^l ~ S«>ex; x*{5{Tu,) \ 5{v)). As we are considering a 



The second equality follows from that if x* > 1 for some a G (5(T^) \ 8{v) then /^'^ ^-j > 1 and hence 

_ fl ^ _ IT' I _ \ \ Ao 

depth- first-search tree (see Figure [3|, 

2 ^ x*{6{T^)\5iv))= x*{S{T^)) + x* U | U ^ M 1 1 -x*{5{v)). (2) 



Since by the feasibility of x* each of the sets corresponds to a cut of fractional value at least 2 we 
use 2 • {\Iy \ + 1) — x*{6{v)) as a lower bound on Q. 
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Summarizing the above calculations yields 

Repeating this argument for each v we have T^vex f" i^i^)) = T.vevT.wex^ (l " f{v,w)) - 
J2vev (2^^^ - 1), which equals OPTlp{G) - n since OPTlp{G) = \ E.ey ^*(^(^'))- □ 



We proceed by bounding X]i;ex ™^^[/'(-^(^)) ~ ^'0] from above. 

Claim 5.4 We have J2vex max[/'(S(^;)) - 1, 0] < (7 - 6^/2)n + 4(^/2 - 1)OPTlp{G) 

Proof of Claim. To analyze this expression we shall use two facts. First G has at most 2n — 1 
edges, and therefore the number of back-arcs is at most 2n — 1 — (n — 1) = n. Second, as the depth- 
first-search chooses (among the available edges) the edge a with maximum x* in each iteration, 
we have that x* < x*^ for each a S B{v) where is the outgoing tree-arc of f G X. Moreover, 

as = min[x*, 1] for each back-arc, the number of back-arcs in B{v) is at least 
Combining these two facts gives us that 



f'(B(v)) 
min[x* {ty),l] 



E 

vex 



• f{B{v)) 
mm[x* (ty) , 1] 



< n. (3) 



For V £ I, we partition f'{B{v)) into 4 = min[2 - x*{t^), f {B{v))] and = f'{B{v)) - iy. 
Furthermore, let u* = Y^^^x With this notation we can upper bound J2vex iiiax[/'(i?(w)) — 1, 0] 
by 

^max[4 - 1,0] +n* (4) 
vex 

and relax Inequality ^ to 

y-^<n-u*. (5) 

The cost Q (where we ignore u*) subject to ([s]) can now be interpreted as a knapsack problem 
of capacity n — u* that is packed with an item of profit max[£„ — 1,0] and size £v/x*{ty) for each 
V £ I. Consequently, we can upper bound (|4]) by considering the fractional knapsack problem 
with capacity n — u* and infinitely many items of a maximized profit to size ratio. Associating a 
variable L with £y and T with x*{tjj) this ratio is maxo<T'<i,o<L<2-T ^^j^ ■ T- For any T the ratio 
is maximized by letting L = 2 — T and we can thus restrict our attention to items with profit to 
size ratio maxo<T<i " T. A simple analysis (see Appendix B.2) shows that the maximum is 
achieved when T = 2 — \/2. Therefore, the profit Q is upper bounded by 

^/2-l 



^/2 



(2 - \/2) • (n - u*) +u* = {yj2- if ■ (n - u*) + u* . 



As the fractional degree of a vertex v that is replaced by a gadget with a set Ij, of in- vertices is at 
least 2 + J2weXy have u* < 2{OPTip{G) — n). Hence, 



n 



@<{V2-lf -{n- 2{OPTlp{G) - n)) + 2{0PTlp{G) 
which equals (7 - 6\/2)n + 4(^/2 - 1)0PTlp{G). □ 
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Figure 4: The approximation ratios of Algorithm [T] and Christofides' algorithm depending on the 
ratio OPTLp{G)/n. 



Finally, by summing up the bounds given by Claim |5.3| and Claim |5.4| we bound the cost of / and 
hence c(C*) from above by OPTlp{G) - n + n(7 - 6^2) + 4(^2 - 1)0PTlp{G), which equals 
6(l-\/2)n + (4\/2-3)OPrLp(G). □ 

Having analyzed Algorithm [l} we are ready to prove our main algorithmic result. 



Theorem 1.1 (Restated) There is a polynomial time approximation algorithm for graph-TSP with 



performance guarantee ^^^^ 13 ^ 1-461. 



Proof. By Lemma 2.1 and the discussion before Algorithm[T| we can restrict ourselves to n-vertex 



graphs that are 2-vertex connected and have at most 2n — 1 edges. The statement now follows by 
using Algorithm [1] if OPTlp[G) is close to n and otherwise by using Christofides' algorithm. 

On the one hand, since Christofides' algorithm returns a solution with at most n — 1 + 
OPTlp{G)/2 edges (see [22] for an analysis of Christofides' algorithm in terms of OPTlp{G)), 
it has an approximation guarantee of at most 

n + OPTLpiG)/2 
OPTlp{G) • 

On the other hand, by Lemma [5. 2[ the approximation guarantee of Algorithm [T] is at most 

|n + I (6(1 - V2)n + (4^/2 - 2,)0PTlp{G)) 



OPTlp{G) 

In particular, the approximation guarantee of Algorithm [T] for a graph G with OPTlp{G) = n is 
4/3+2/3-(\/2-l)2 PS 1.4477 but deteriorates as OPTlp{G) increases. The approximation guarantee 
of Christofides' algorithm on the other hand is getting better and better as OPTlp{G) increases. 
Comparing these two ratios, one gets that the worst case happens when OPTlp{G) = ^^^~^^ n 
(see Figure |4| and, by using simple arithmetics, the approximation guarantee can be seen to be 

14(v^-l) □ 
12-^2-13 ■ 
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TSP integrality gap instance 



,» — • • • 
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• • • • 



TSPP integrality gap instance 



* • • • 



Figure 5: Graphs for which the Held-Karp relaxation and the Held-Karp relaxation adapted to 
graph-TSPP have an integrality gap tending to 4/3 and 1.5, respectively. 

6 The Traveling Salesman Path Problem 

In this section, we describe a sequence of generalizations and modifications of the techniques that 
we previously presented for graph-TSP and conclude with improved approximation algorithms for 
the traveling salesman path problem on graphic metrics, graph-TSPP. 

6.1 Using Held-Karp for Graph-TSPP 

We can obtain a natural generalization of LP{G) to graph-TSPP by distinguishing whether the 
end vertices s and t are in the same set of vertices. To this end, let $ = {5 C y | {s,t} C 
5 or n {s, t} = 0}. Then the relaxation can be written as 

LP{G,s,t): min^^Xe 

x{5{S))>2, 9^ScV,Se^ 
x{5{S))>l, 9^ScV,S^^ 
X > 0. 

We denote the optimum of this generalized linear program by OPT]^p{G, s, t). It is not hard to see 
that OPTlp{G) = OPTlp{G,s,s). 

The graph on the right-hand-side in Figure [5] has a fractional solution such that the integrality 
gap of LP{G,s,t) is lower bounded by 1.5. 

For a given graph G = {V, E), let G' = {V,EU {e'}) be the graph obtained from G by inserting 
e' = {s,t}. Note that, given any solution x to LP{G, s,t), we can obtain a feasible solution to 
LP{G') by adding 1 to x^,'- This way, for each of the cuts where 5 ^ $, we have 5{S) > 2 and thus 
OPTlp{G') < OPTlp{G, s,t) + 1. In the following, we will generalize our results for graph-TSP 
by using OPTip{G') — 1 as lower bound. 

Similar to graph-TSP, we observe that the difficulty in approximating graph-TSPP lies in 
approximating those instances that are 2-vertex connected. The proof of this lemma can be found 
in Appendix |B.l I 



Lemma 2.1 (Generalized) Let G be a graph and let A be an algorithm that, given a 2-vertex- 
connected subgraph H of G and s,t E V{H), returns a graph-TSPP solution to {H,s,t) with cost 
at most r ■ OPTlp{H, s,t). Then there is an algorithm A' that returns a graph-TSPP solution to 
{G,s,t) for any s,t € V^G) with cost at most r ■ OPTlp{G, s,t) . Furthermore, the running time 
of A' is a polynomial in the running time of A. 
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6.2 Generalization of the Approximation Framework to graph-TSPP 

We generalize the framework to the problem graph-TSPP. We obtain an approximation ratio that 



depends on dist{s,t), the distance of s and t. Therefore we can see the variant of Theorem 3.2 for 
graph-TSP as a special case where s and t have the distance 0. 



Theorem 3.2 (Generalized) Given a 2-vertex connected graph G = (y, E) with a removable pairing 



{R, P) and s,t G V , there is a polynomial time algorithm that returns a spanning subgraph H of G 

4| 



with an Eulerian path between s and t with at most — V\R\ + dist{s,t)/3 edges. 



Proof. A graph has an Eulerian path between s and t if and only if it is connected and the 
multigraph obtained by adding the edge e' = {s, t} is a spanning Eulerian subgraph. Therefore, we 



basically want to apply (the original) Theorem 3.2 and swap the degree of s and t 



To this end we create the graph G' = {V, E') from G by adding the edge e' to E if it is not 



already present in G. Then we apply Theorem 3.2 to G' with the removable pairing (R, P) to 
obtain the spanning Eulerian subgraph G. 

If the Eulerian graph G contains exactly one copy of e', we simply remove it to obtain H. This 
case appears if and only if e' was not chosen during the sampling, which happens with a probability 
of 2/3. Note that the 2-edge-connectedness ensures that the removal does not disconnect G. 

Otherwise, with probability 1/3, G contains either two copies of e' ii e' ^ R or none if e' G R. 
In either case we obtain H from G by removing all copies of e' and adding a shortest path of length 
exactly dist{s,t) to G. If e' £ E, we add a path with probability 1/3 and apart from that we only 
remove edges; the claimed result follows immediately. If e' ^ E, it is also not in R and thus the 
path is added if and only if two edges are removed. Furthermore, with probability 2/3, one edge is 
removed. Then the expected number of edges in H is 

4,,^, , 2,^, dist(s,t)-2 , 4,^, 2,^, dist(s,t) 
-( E + 1 - - R + 2/3 = - E - - R + 

Both the removal of e' and adding the shortest path swaps the parities of s and t, but of no 
other vertex. □ 



By using the generalized Theorem 3.2 within the proof of Lemma 4.1, we obtain immediately 
the following generalization. 



Lemma 4.1 (Generalized) Given a 2-vertex connected graph G, two vertices s,t in G, and a depth 
first search tree T of G, let C* be the minimum cost circulation to C{G,T) of cost c(C*). Then 
there is a spanning multigraph H of G that has an Eulerian path between s and t with at most 



|n + |c(C*) - 2/3 + dist{s, t) /3 edges. 
6.3 Approximation Algorithms for Graph-TSPP 

We are now equipped with the right tools to obtain algorithmic results for graph-TSPP. 



Theorem 1.2 (Restated) For any e > 0, there is a polynomial time approximation algorithm for 
graph-TSPP with performance guarantee 3 — \/2 -\- e < 1.586 -|- e. 

If furthermore each block of the given graph is degree three bounded, there is a polynomial time 
approximation algorithm for graph-TSPP with performance guarantee 1.5 -|- e, for any e > 0. 
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Proof. By the generalized variant of Lemma 2.1 , it is sufficient to show the theorem assuming that 
G is 2-vertex connected. 



If G is degree three bounded, we apply Lemma 5.1 on G, but use the generalized version of 



Lemma 4.1 to obtain a solution to graph-TSPP that has at most An/3 — 2/3 + dist{s,t)/3 edges. 
Additionally we may replace dist{s,t) by n/2, since in 2-vertex-connected graphs with more than 
two vertices there are two vertex-disjoint paths between s and t. 

To obtain the claimed approximation ratio, we use the trivial lower bound n— 1 of OPTlp{G, s, t). 
For any e, we determine a constant uq such that, for all n > no, the approximation ratio is bounded 
from above by 1.5 + e. If the graph has fewer than uq vertices, we compute an optimal solution in 
constant time. 

We continue with the case of general unweighted graphs. As in the previous subsections, e' = 
{s,t}. We apply Algorithm [T]^ to obtain a circulation C* of G' = {V,E{J {e'}) such that, by 
c{G'*) < 6(1 - V2)n + (4\/2 - 3)OPTlp{G'). Using this circulation, we apply the 



5.2 



Lemma 

generalized version of Lemma 4.1 However, \i e' ^ E and it is used in the solution (i.e., it was 



added as a shortest path), we have to replace e' by a shortest path between s and t in G. This 



is equivalent to using dist{s,t) from G instead of G' in Lemma 4.1 Therefore, in the following 



dist{s, t) always refers to the distance in G and we obtain a solution to graph-TSPP of at most 

+ ^(6(1 - V2)n + {4^2 - 3)OPnp{G')) - ? + ^^^^ 
= (16/3-4\/2)n-Fdist(s,t)/3 + (8\/2/3-2)(OPrLp(G')) -2/3 

edges. 

In the following, let d = dist{s,t)/n and C = {OPTlp{G') — l)/n. Then, using the lower bound 
OPTlp{G') — 1 on OPTlp{G, s,t), the approximation ratio achieved by our algorithm is at most 

'^^^-^ + "■'^ 8^2/3-2 + ... (0) 

where C > 1 - 1/n and ei = (8\/2/3 - ^/3)/0PTlp{G'). In the following calculations, we omit ei, 
since it decreases with the input size. Similarly, we assume C ^ 1- We will consider the deviation, 
however, in the final result. 

Since Q depends on C,, similar to the case of graph-TSP we employ a second algorithm to 
obtain an upper bound independent of C,- 

Let A be the following simple approximation algorithm for graph-TSPP which can be considered 
folklore. First, A computes a spanning tree T of cost n — 1 in G. Then A doubles all edges but 
those on the unique path between s and t in T. 

The output of A is clearly a valid solution to graph-TSPP and it computes a solution of at most 
2 • (n — 1) — dist{s, t) edges. Similar to ([6]), this results in an approximation ratio of at most 

(2 - d)/C. (7) 



Note that for C = 1 d = \/2 — 1, disregarding e, ([7| is the approximation ratio we are aiming 
for. Any increase of (" or d can only improve this ratio. Therefore we may restrict the analysis to 
values of d in the range [0, \/2 — 1]. 

We will first analyze the approximation ratio depending on d and determine afterwards the 
value of d where the minimum of the two approximation ratios is maximized. 

For any fixed d within the considered range, ^ is monotonically increasing with respect to C, 
whereas ^ is monotonically decreasing. Since we are interested in the minimum of the ratios, in 
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Figure 6: The minimum of the approximation ratios (|6| and ([7]) depending on d and 



the worst case both ratios are equal. This happens when 

8^2-6 • 

We now replace C by Q in ([T]) to obtain the worst case approximation ratio depending on d 

8^/2 - 6 - idV2 + 3d 
6^/2 -2d -5 

Since this ratio can be seen to be monotonically increasing with respect to d within the considered 
range, the worst case appears when d = \pi — 1, and thus we obtain as upper bound on the 
approximation ratio 

8V^- 6 -4(^/2- 1)^/2 + 3(\/2 - 1) _ 15^/2-17 _^ ^ 
6^/2 - 2(^/2 - 1) - 5 ~ 4^/2-3 

To conclude the proof, we still have to consider e\ and the case where C < 1- For any e > 0, 
we determine an no based on e\ and C similar to the degree bounded case and solve graph-TSPP 
on graphs with fewer than no vertices exactly. Altogether, we obtain an approximation ratio of at 
most 

3 - \/2 + e 

(see also Figure [6]) . □ 



7 Conclusions 

We have introduced a framework of removable pairings to find Eulerian multigraphs. This frame- 
work proved to be useful to obtain an approximation algorithm for graph-TSP with an approxi- 
mation ratio smaller than 1.461 and to obtain a tight upper bound on the integrality gap of the 
Held-Karp relaxation for a restricted class of graphs that contains degree three bounded and claw- 
free graphs. In particular, we showed that in subcubic 2-vertex-connected graphs we can always 
find a solution to graph-TSP of at most 4n/3 — 2/3 edges, which settles a conjecture from |3] 
affirmatively. 
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Our framework is not restricted to graph-TSP. With the same techniques and a more detailed 
analysis, our result translates to the traveling salesman path problem on graphic metrics with 
prespecified start and end vertex. In this way, one is guaranteed to obtain an approximation ratio 
smaller than 1.586 and, for the degree three bounded case, the approximation ratio gets arbitrarily 
close to 1.5. 

We note that the framework of removable pairings is straightforward to generalize to general 
metrics, but the problem of finding a large enough removable pairing in such graphs in order to 
improve on Christofides' algorithm remains open. 
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A 



Example of Circulation Network 

Depth first search tree T of G 



Circulation network C{G,T) 




Figure 7: The circulation network C{G,T) of a graph G with depth- first-tree T. In- vertices and 
out-vertices of the circulation network is depicted in white and black, respectively. 



B Omitted Proofs 
B.l Proof of Lemma 12.11 

We prove the more general lemma from Section [6] that also applies to the traveling salesman path 
problem. 



Lemma 2.1 (Restated) Let G be a graph and let A be an algorithm that, given a 2-vertex- connected 
subgraph H of G and s,t £ V{H), returns a graph-TSPP solution to {H,s,t) with cost at most 
r ■ OPTlp{H, s,t). Then there is an algorithm A' that returns a graph-TSPP solution to {G,s,t) 
for any s, t € V{G) with cost at most r ■ OPTLp(^G,s,t). Furthermore, the running time of A' is a 
polynomial in the running time of A. 

Proof. We define an r-approximation algorithm A' for G as follows: 
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1. If G is 2- vertex connected then return the graph-TSPP solution obtained by running A on 
{G,s,t). 

2. Otherwise, let f be a cut vertex whose removal results in components Ci,C2, ■ ■ ■ ,Ci with 
/ > 1. Recursively run A' on the / sub-instances {Gi, si,ti), . . . , (Gi, si,ti) and return the 
union of the obtained solutions, where Gj denotes the subgraph of G induced by Ci U {v}, 

fs ifsGCi , ft if ted 

Si = < _ and ti = < 

\v otherwise \v otherwise 

As a vertex is selected to be a cut vertex at most once, A' terminates in time bounded by a 
polynomial in the running time of A. It remains to verify that it returns a graph-TSPP solution to 
{G, s, t) with cost at most r-OPTip{G, s, t). We do so by induction on the depth of the recursion. In 
the base case no recursive calls are made so the solution is that returned by A which by assumption 
is a graph-TSPP solution to {G, s,t) with cost at most r ■ OPTlp{G, s,t). 

Now consider the inductive step when a cut vertex t; of G is selected whose removal results in 
components Gi, G2, . . . , G; with I > 1. Let Ei be the multiset of edges of the obtained graph-TSPP 
solution to {Gi, Sj, U). With this notation the edge set returned by A' is IJi=i need to 

prove that 

(a) it is a feasible graph-TSPP solution to {G,s,t), i.e, the edge set Ui=i-^i ^ {s,t} forms a 
spanning Eulerian subgraph; and 

(b) T,U\^i\<r-OPTLpiG,s,t). 

We start by proving (a). By the induction hypothesis, the edge set EiU{si, U} forms a spanning 
Eulerian subgraph of Gi and, consequently, [J^^i {Ei U {si,ti}) forms a spanning Eulerian subgraph 
ofG. ThatU-=i-^*U{s,t} is a spanning Eulerian subgraph of G now follows from that the endpoints 
of {si,ti}, {s2, ^2}, • • • , {si, ti} can be partitioned so that one is s, one is t and the remaining 2{i— 1) 
endpoints are v ( possibly not different from s and t). 

We proceed by proving (b). By the induction hypothesis, Yli=i ^ ''"'Yli=i OPTi,p{Gi, Si, U) 
and it is thus sufficient to prove ^1^=1 OPT^piGi, Si,ti) < OPTlp{G, s, t). To this end. Let x be an 
optimal solution to LP{G, s, t) and let denote its restriction to the subgraph Gj with start vertex 
Si and end vertex tj. By the definition of Gi,Si,ti and the fact that is a cut vertex, it is easy 
to see that each constraint in LP{Gi,Si,ti) has an identical constraint in LP{G,s,t). Therefore, 
X* corresponds to a solution to LP{Gi, Si,ti) and hence OPTlp{G, s,t) > Yli=iOPTLp{Gi, Si,ti), 
which completes the inductive step and the proof of the lemma. □ 



B.2 McLximum Profit to Size Ratio 

We verify that maxo<T<i is obtained when T = 2 — \/2. Let f{T) = ^^z^T = ^zTp ~ i=T ^^"^ 

consider its first derivative 

d _ 1 T f 2T ^2 \ ;l - 2T T - r2 

^/( ) - + (2 - r)2 ~ ^ (2 - r)2 J ~ ^ (2 - r)2 ' 

Prom this it follows that ^f{T) = when 

(1 - 2r)(2 - T) r - = <^ - 4T + 2 = T = 2 ± \/2. 

It is now easy to verify that the unique maximum of f{T) for < T < 1 is obtained when 
T = 2-V2. 
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